Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition System

نویسنده

  • Sanghamitra V. Arora
چکیده

In this paper, improvement of an ASR system for Hindi language, based on Vector quantized MFCC as feature vectors and HMM as classifier, is discussed. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is to create delta and delta-delta coefficients and append them to MFCC to create feature vector. This paper focuses on all digits in Hindi (Zero to Nine), which is based on isolated word structure. Performance of the system is evaluated by accurate Recognition Rate (RR). The effect of the combination of the Delta MFCC (DMFCC) feature along with the Delta-Delta MFCC (DDMFCC) feature shows approximately 2.5% further improvement in the RR, with no additional computational costs involved. RR of the system for the speakers involved in the training phase is found to give better recognition accuracy than that for the speakers who were not involved in the training phase. Word wise RR is observed to be good in some digits with distinct phones.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integration of articulatory dynamic parameters in HMM/BN based speech recognition system

In this paper, we describe several approaches to integration of the articulatory dynamic parameters along with articulatory position data into a HMM/BN model based automatic speech recognition system. This work is a continuation of our previous study, where we have successfully combined speech acoustic features in form of MFCC with articulatory position observations. Articulatory dynamic parame...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Speaker Dependent Speaker Recognition Using Svm and Hmm

Speaker recognition is the process of recognizing the speaker based on characteristics such as pitch, tone in the speech wave.Background noise influences the overall efficiency of speaker recognition system and is still considered as one of the most challenging issue in Speaker Recognition System (SRS). Support Vector Machine (SVM) and Hidden Markov Model (HMM) are widely used techniques for sp...

متن کامل

Optimization of Features Parameters for HMM Phoneme Recognition of TIMIT Corpus

Phoneme is the smallest contrastive unit in the sound system of a language. Moreover, it has a meaningful role in speech recognition. In this study, we are interesting for phonemes recognition of Timit database using HTK toolkit for HMM. The main goal is to determine the optimal parameters for the recognizer. For this reason, different speech analysis techniques were operated such as Mel Freque...

متن کامل

Comparison and combination of RASTA-PLP and FF features in a hybrid HMM/MLP speech recognition system

Recently, the advantages of the spectral parameters obtained by frequency filtering (FF) of the logarithmic filter bank energies (logFBEs) have been reported. These parameters, which are frequency derivatives of the logFBEs, lie in the frequency domain, and have shown good recognition performance with respect to the conventional mel-frequency cepstral coefficients (MFCC) for HMM systems. In thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013